Dataset info
| Number of variables | 182 |
|---|---|
| Number of observations | 555 |
| Missing cells | 67353 (66.7%) |
| Duplicate rows | 0 (0.0%) |
| Total size in memory | 736.1 KiB |
| Average record size in memory | 1.3 KiB |
Variables types
| Numeric | 30 |
|---|---|
| Categorical | 23 |
| Boolean | 33 |
| Date | 0 |
| URL | 0 |
| Text (Unique) | 1 |
| Rejected | 95 |
| Unsupported | 0 |
Warnings
coligada_mais_antiga_ativa has 493 (88.8%) missing values | Missing |
coligada_mais_antiga_baixada has constant value "nan" | Rejected |
coligada_mais_nova_ativa has 493 (88.8%) missing values | Missing |
coligada_mais_nova_baixada has constant value "nan" | Rejected |
de_faixa_faturamento_estimado has 25 (4.5%) missing values | Missing |
de_faixa_faturamento_estimado_grupo has 25 (4.5%) missing values | Missing |
de_indicador_telefone has 501 (90.3%) missing values | Missing |
de_saude_rescencia has 11 (2.0%) missing values | Missing |
de_saude_tributaria has 11 (2.0%) missing values | Missing |
dt_situacao only contains datetime values, but is categorical. Consider applying pd.to_datetime() | Type |
dt_situacao has a high cardinality: 416 distinct values | Warning |
empsetorcensitariofaixarendapopulacao has 150 (27.0%) missing values | Missing |
faturamento_est_coligados has 493 (88.8%) missing values | Missing |
faturamento_est_coligados_gp is highly correlated with faturamento_est_coligados (ρ = 0.9798738459) | Rejected |
fl_epp has constant value "False" | Rejected |
fl_ltda has constant value "False" | Rejected |
fl_optante_simei has 101 (18.2%) missing values | Missing |
fl_optante_simples has 101 (18.2%) missing values | Missing |
fl_simples_irregular has constant value "False" | Rejected |
fl_spa has constant value "False" | Rejected |
fl_st_especial has constant value "False" | Rejected |
grau_instrucao_macro_analfabeto has 554 (99.8%) missing values | Missing |
grau_instrucao_macro_desconhecido has constant value "nan" | Rejected |
grau_instrucao_macro_escolaridade_fundamental has 530 (95.5%) missing values | Missing |
grau_instrucao_macro_escolaridade_media has 469 (84.5%) missing values | Missing |
grau_instrucao_macro_escolaridade_superior has 532 (95.9%) missing values | Missing |
idade_acima_de_58 is highly correlated with grau_instrucao_macro_escolaridade_fundamental (ρ = 0.9173649446) | Rejected |
idade_ate_18 has 551 (99.3%) missing values | Missing |
idade_de_19_a_23 is highly correlated with idade_acima_de_58 (ρ = 0.9432422183) | Rejected |
idade_de_24_a_28 has 509 (91.7%) missing values | Missing |
idade_de_29_a_33 is highly correlated with grau_instrucao_macro_escolaridade_media (ρ = 0.9270644425) | Rejected |
idade_de_34_a_38 has 513 (92.4%) missing values | Missing |
idade_de_39_a_43 has 523 (94.2%) missing values | Missing |
idade_de_44_a_48 has 534 (96.2%) missing values | Missing |
idade_de_49_a_53 is highly correlated with idade_de_29_a_33 (ρ = 0.9490484453) | Rejected |
idade_de_54_a_58 is highly correlated with idade_de_34_a_38 (ρ = 0.9779514754) | Rejected |
idade_maxima_coligadas is highly correlated with coligada_mais_antiga_ativa (ρ = 1) | Rejected |
idade_maxima_socios has 193 (34.8%) missing values | Missing |
idade_media_coligadas has 492 (88.6%) missing values | Missing |
idade_media_coligadas_ativas is highly correlated with idade_media_coligadas (ρ = 1) | Rejected |
idade_media_coligadas_baixadas has constant value "nan" | Rejected |
idade_media_socios is highly correlated with idade_maxima_socios (ρ = 0.9663386807) | Rejected |
idade_minima_coligadas is highly correlated with coligada_mais_nova_ativa (ρ = 1) | Rejected |
idade_minima_socios is highly correlated with idade_media_socios (ρ = 0.9672309258) | Rejected |
max_faturamento_est_coligados is highly correlated with faturamento_est_coligados_gp (ρ = 0.9753288607) | Rejected |
max_faturamento_est_coligados_gp is highly correlated with max_faturamento_est_coligados (ρ = 0.9208848359) | Rejected |
max_filiais_coligados has 540 (97.3%) missing values | Missing |
max_funcionarios_coligados_gp is highly correlated with max_faturamento_est_coligados_gp (ρ = 0.9988162351) | Rejected |
max_meses_servicos has 454 (81.8%) missing values | Missing |
max_meses_servicos_all has 424 (76.4%) missing values | Missing |
max_vl_folha_coligados is highly correlated with max_faturamento_est_coligados (ρ = 0.9432797848) | Rejected |
max_vl_folha_coligados_gp is highly correlated with max_funcionarios_coligados_gp (ρ = 0.9830355726) | Rejected |
media_faturamento_est_coligados is highly correlated with max_funcionarios_coligados_gp (ρ = 0.9460011571) | Rejected |
media_faturamento_est_coligados_gp has 493 (88.8%) missing values | Missing |
media_filiais_coligados is highly correlated with max_filiais_coligados (ρ = 0.984817247) | Rejected |
media_funcionarios_coligados_gp is highly correlated with media_filiais_coligados (ρ = 0.9290150933) | Rejected |
media_meses_servicos has 454 (81.8%) missing values | Missing |
media_meses_servicos_all is highly correlated with max_meses_servicos_all (ρ = 0.9651590156) | Rejected |
media_vl_folha_coligados is highly correlated with media_faturamento_est_coligados (ρ = 0.9023282866) | Rejected |
media_vl_folha_coligados_gp is highly correlated with media_funcionarios_coligados_gp (ρ = 0.9956423815) | Rejected |
meses_ultima_contratacaco has 424 (76.4%) missing values | Missing |
min_faturamento_est_coligados is highly correlated with media_faturamento_est_coligados_gp (ρ = 0.9401087019) | Rejected |
min_faturamento_est_coligados_gp is highly correlated with min_faturamento_est_coligados (ρ = 0.9857654776) | Rejected |
min_filiais_coligados is highly correlated with media_vl_folha_coligados_gp (ρ = 0.9113088103) | Rejected |
min_funcionarios_coligados_gp is highly correlated with min_faturamento_est_coligados_gp (ρ = 0.9974669262) | Rejected |
min_meses_servicos has 454 (81.8%) missing values | Missing |
min_meses_servicos_all has 424 (76.4%) missing values | Missing |
min_vl_folha_coligados is highly correlated with min_faturamento_est_coligados (ρ = 0.9115581462) | Rejected |
min_vl_folha_coligados_gp is highly correlated with min_funcionarios_coligados_gp (ρ = 0.9997894656) | Rejected |
nm_meso_regiao has 64 (11.5%) missing values | Missing |
nm_micro_regiao has a high cardinality: 67 distinct values | Warning |
nm_micro_regiao has 64 (11.5%) missing values | Missing |
nu_meses_rescencia has 47 (8.5%) missing values | Missing |
percent_func_genero_fem has 22 (4.0%) zeros | Zeros |
percent_func_genero_fem has 454 (81.8%) missing values | Missing |
percent_func_genero_masc has 32 (5.8%) zeros | Zeros |
percent_func_genero_masc has 454 (81.8%) missing values | Missing |
qt_admitidos has 424 (76.4%) missing values | Missing |
qt_admitidos_12meses has 89 (16.0%) zeros | Zeros |
qt_admitidos_12meses has 424 (76.4%) missing values | Missing |
qt_alteracao_socio_180d has constant value "nan" | Rejected |
qt_alteracao_socio_365d has constant value "nan" | Rejected |
qt_alteracao_socio_90d has constant value "nan" | Rejected |
qt_alteracao_socio_total has constant value "nan" | Rejected |
qt_art is highly correlated with qt_admitidos (ρ = 1) | Rejected |
qt_coligadas is highly correlated with min_filiais_coligados (ρ = 0.9682511623) | Rejected |
qt_coligados is highly correlated with qt_coligadas (ρ = 0.9974819746) | Rejected |
qt_coligados_agropecuaria has 492 (88.6%) missing values | Missing |
qt_coligados_atividade_alto has 492 (88.6%) missing values | Missing |
qt_coligados_atividade_baixo has 492 (88.6%) missing values | Missing |
qt_coligados_atividade_inativo has 492 (88.6%) missing values | Missing |
qt_coligados_atividade_medio has 492 (88.6%) missing values | Missing |
qt_coligados_atividade_mt_baixo has 492 (88.6%) missing values | Missing |
qt_coligados_ativo has 492 (88.6%) missing values | Missing |
qt_coligados_baixada has 492 (88.6%) missing values | Missing |
qt_coligados_ccivil is highly correlated with qt_art (ρ = 1) | Rejected |
qt_coligados_centro is highly correlated with qt_coligadas (ρ = 0.9142772641) | Rejected |
qt_coligados_comercio has 21 (3.8%) zeros | Zeros |
qt_coligados_comercio has 492 (88.6%) missing values | Missing |
qt_coligados_epp has 492 (88.6%) missing values | Missing |
qt_coligados_exterior has 492 (88.6%) missing values | Missing |
qt_coligados_inapta has 492 (88.6%) missing values | Missing |
qt_coligados_industria is highly correlated with qt_coligados_ccivil (ρ = 0.9011165757) | Rejected |
qt_coligados_ltda is highly correlated with qt_coligados_centro (ρ = 1) | Rejected |
qt_coligados_matriz is highly correlated with qt_coligados (ρ = 0.991544335) | Rejected |
qt_coligados_me has 492 (88.6%) missing values | Missing |
qt_coligados_mei has 492 (88.6%) missing values | Missing |
qt_coligados_nordeste is highly correlated with qt_coligados_ativo (ρ = 0.9006896596) | Rejected |
qt_coligados_norte is highly correlated with qt_art (ρ = 1) | Rejected |
qt_coligados_nula has 492 (88.6%) missing values | Missing |
qt_coligados_sa is highly correlated with qt_coligados_industria (ρ = 0.9802385571) | Rejected |
qt_coligados_serviço is highly correlated with max_vl_folha_coligados (ρ = 0.9134551877) | Rejected |
qt_coligados_sudeste is highly correlated with qt_coligados_ltda (ρ = 0.9443641162) | Rejected |
qt_coligados_sul has 492 (88.6%) missing values | Missing |
qt_coligados_suspensa has 492 (88.6%) missing values | Missing |
qt_desligados is highly correlated with qt_art (ρ = 1) | Rejected |
qt_desligados_12meses is highly correlated with qt_art (ρ = 1) | Rejected |
qt_ex_funcionarios is highly correlated with qt_desligados (ρ = 0.9999417874) | Rejected |
qt_filiais is highly correlated with qt_coligados_sa (ρ = 0.9757969605) | Rejected |
qt_funcionarios is highly correlated with idade_de_49_a_53 (ρ = 0.9326400842) | Rejected |
qt_funcionarios_12meses is highly correlated with qt_funcionarios (ρ = 0.9757958364) | Rejected |
qt_funcionarios_24meses is highly correlated with qt_admitidos (ρ = 0.9220046051) | Rejected |
qt_funcionarios_coligados is highly correlated with qt_filiais (ρ = 0.9495362911) | Rejected |
qt_funcionarios_coligados_gp is highly correlated with qt_funcionarios_coligados (ρ = 0.971940039) | Rejected |
qt_funcionarios_grupo is highly correlated with qt_funcionarios_coligados_gp (ρ = 0.9157758137) | Rejected |
qt_ramos_coligados is highly correlated with qt_funcionarios_grupo (ρ = 0.966877865) | Rejected |
qt_regioes_coligados is highly correlated with qt_coligadas (ρ = 0.9142772641) | Rejected |
qt_socios has 147 (26.5%) missing values | Missing |
qt_socios_coligados is highly correlated with qt_coligados_ativo (ρ = 0.9824935146) | Rejected |
qt_socios_feminino has 365 (65.8%) missing values | Missing |
qt_socios_masculino is highly correlated with qt_art (ρ = 1) | Rejected |
qt_socios_pep is highly correlated with qt_socios_masculino (ρ = 1) | Rejected |
qt_socios_pf is highly correlated with qt_socios_pep (ρ = 1) | Rejected |
qt_socios_pj has 147 (26.5%) missing values | Missing |
qt_socios_pj_ativos is highly correlated with qt_socios_pj (ρ = 1) | Rejected |
qt_socios_pj_baixados has 553 (99.6%) missing values | Missing |
qt_socios_pj_inaptos has 553 (99.6%) missing values | Missing |
qt_socios_pj_nulos has 553 (99.6%) missing values | Missing |
qt_socios_pj_suspensos has 553 (99.6%) missing values | Missing |
qt_socios_st_regular is highly correlated with qt_socios_pep (ρ = 1) | Rejected |
qt_socios_st_suspensa has 548 (98.7%) missing values | Missing |
qt_ufs_coligados is highly correlated with qt_socios_pj_ativos (ρ = 1) | Rejected |
sum_faturamento_estimado_coligadas is highly correlated with qt_regioes_coligados (ρ = 0.9794843167) | Rejected |
total is highly correlated with qt_funcionarios_12meses (ρ = 0.9743791866) | Rejected |
total_filiais_coligados has 540 (97.3%) missing values | Missing |
tx_crescimento_12meses has 63 (11.4%) zeros | Zeros |
tx_crescimento_12meses has 458 (82.5%) missing values | Missing |
tx_crescimento_24meses is highly correlated with min_vl_folha_coligados_gp (ρ = 0.9675760839) | Rejected |
tx_rotatividade is highly correlated with qt_art (ρ = 1) | Rejected |
Unnamed_0 is highly correlated with qt_socios_pj_ativos (ρ = 1) | Rejected |
vl_faturamento_estimado_aux is highly correlated with qt_socios_pj_ativos (ρ = 1) | Rejected |
vl_faturamento_estimado_grupo_aux is highly correlated with qt_socios_pj_ativos (ρ = 1) | Rejected |
vl_folha_coligados is highly correlated with vl_faturamento_estimado_grupo_aux (ρ = 0.9801776077) | Rejected |
vl_folha_coligados_gp is highly correlated with vl_folha_coligados (ρ = 0.9769040844) | Rejected |
vl_frota is highly correlated with total_filiais_coligados (ρ = 1) | Rejected |
vl_idade_maxima_socios_pj is highly correlated with vl_faturamento_estimado_grupo_aux (ρ = 1) | Rejected |
vl_idade_media_socios_pj is highly correlated with nu_meses_rescencia (ρ = 1) | Rejected |
vl_idade_minima_socios_pj is highly correlated with vl_idade_media_socios_pj (ρ = 1) | Rejected |
vl_potenc_cons_oleo_gas is highly correlated with tx_crescimento_24meses (ρ = 1) | Rejected |
vl_total_tancagem has constant value "nan" | Rejected |
vl_total_tancagem_grupo has constant value "nan" | Rejected |
vl_total_veiculos_antt has constant value "nan" | Rejected |
vl_total_veiculos_antt_grupo has constant value "nan" | Rejected |
vl_total_veiculos_leves is highly correlated with total_filiais_coligados (ρ = 1) | Rejected |
vl_total_veiculos_leves_grupo is highly correlated with vl_idade_maxima_socios_pj (ρ = 1) | Rejected |
vl_total_veiculos_pesados is highly correlated with vl_potenc_cons_oleo_gas (ρ = 1) | Rejected |
vl_total_veiculos_pesados_grupo is highly correlated with vl_total_veiculos_leves_grupo (ρ = 0.9945105446) | Rejected |
coligada_mais_antiga_ativa
Numeric
| Distinct count | 62 |
|---|---|
| Unique (%) | 11.2% |
| Missing (%) | 88.8% |
| Missing (n) | 493 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 218.4887097 |
|---|---|
| Minimum | 4.066666667 |
| Maximum | 636.6 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 4.066666667 |
|---|---|
| 5-th percentile | 19.97 |
| Q1 | 90.18333333 |
| Median | 191.1 |
| Q3 | 325.1333333 |
| 95-th percentile | 521.6433333 |
| Maximum | 636.6 |
| Range | 632.5333333 |
| Interquartile range | 234.95 |
Descriptive statistics
| Standard deviation | 165.7745939 |
|---|---|
| Coef of variation | 0.7587329989 |
| Kurtosis | 0.07860743476 |
| Mean | 218.4887097 |
| MAD | 130.713042 |
| Skewness | 0.883628313 |
| Sum | 13546.3 |
| Variance | 27481.21599 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 27.73333333 | 2 | 0.4% | |
| 10.16666667 | 1 | 0.2% | |
| 521.8 | 1 | 0.2% | |
| 354.5333333 | 1 | 0.2% | |
| 87.7 | 1 | 0.2% | |
| 399.1666667 | 1 | 0.2% | |
| 383.2 | 1 | 0.2% | |
| 103.2666667 | 1 | 0.2% | |
| 93.73333333 | 1 | 0.2% | |
| 135.4666667 | 1 | 0.2% | |
| Other values (51) | 51 | 9.2% | |
| (Missing) | 493 | 88.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 4.066666667 | 1 | 0.2% | |
| 9.066666667 | 1 | 0.2% | |
| 10.16666667 | 1 | 0.2% | |
| 19.56666667 | 1 | 0.2% | |
| 27.63333333 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 636.6 | 1 | 0.2% | |
| 634.1666667 | 1 | 0.2% | |
| 611.2 | 1 | 0.2% | |
| 521.8 | 1 | 0.2% | |
| 518.6666667 | 1 | 0.2% |
coligada_mais_antiga_baixada
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
coligada_mais_nova_ativa
Numeric
| Distinct count | 62 |
|---|---|
| Unique (%) | 11.2% |
| Missing (%) | 88.8% |
| Missing (n) | 493 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 121.3016129 |
|---|---|
| Minimum | 1.566666667 |
| Maximum | 476.0666667 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.566666667 |
|---|---|
| 5-th percentile | 5.041666667 |
| Q1 | 24.35833333 |
| Median | 91.36666667 |
| Q3 | 194.325 |
| 95-th percentile | 367.1716667 |
| Maximum | 476.0666667 |
| Range | 474.5 |
| Interquartile range | 169.9666667 |
Descriptive statistics
| Standard deviation | 117.362627 |
|---|---|
| Coef of variation | 0.9675273408 |
| Kurtosis | 1.343809131 |
| Mean | 121.3016129 |
| MAD | 91.36531391 |
| Skewness | 1.286324323 |
| Sum | 7520.7 |
| Variance | 13773.98621 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 27.73333333 | 2 | 0.4% | |
| 125.6333333 | 1 | 0.2% | |
| 16.23333333 | 1 | 0.2% | |
| 1.566666667 | 1 | 0.2% | |
| 93.73333333 | 1 | 0.2% | |
| 135.4666667 | 1 | 0.2% | |
| 476.0666667 | 1 | 0.2% | |
| 225.0333333 | 1 | 0.2% | |
| 10.36666667 | 1 | 0.2% | |
| 194.3 | 1 | 0.2% | |
| Other values (51) | 51 | 9.2% | |
| (Missing) | 493 | 88.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.566666667 | 1 | 0.2% | |
| 4.066666667 | 1 | 0.2% | |
| 4.633333333 | 1 | 0.2% | |
| 4.866666667 | 1 | 0.2% | |
| 8.366666667 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 476.0666667 | 1 | 0.2% | |
| 450.0666667 | 1 | 0.2% | |
| 426.4333333 | 1 | 0.2% | |
| 368.2666667 | 1 | 0.2% | |
| 346.3666667 | 1 | 0.2% |
coligada_mais_nova_baixada
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
de_faixa_faturamento_estimado
Categorical
| Distinct count | 8 |
|---|---|
| Unique (%) | 1.4% |
| Missing (%) | 4.5% |
| Missing (n) | 25 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 41 |
| Other values (4) | 20 |
| (Missing) | 25 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 325 | 58.6% | |
| ATE R$ 81.000,00 | 144 | 25.9% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 41 | 7.4% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 16 | 2.9% | |
| SEM INFORMACAO | 2 | 0.4% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 1 | 0.2% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 1 | 0.2% | |
| (Missing) | 25 | 4.5% |
| Max length | 38 |
|---|---|
| Mean length | 26.17477477 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_faixa_faturamento_estimado_grupo
Categorical
| Distinct count | 10 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 4.5% |
| Missing (n) | 25 |
| DE R$ 81.000,01 A R$ 360.000,00 | |
|---|---|
| ATE R$ 81.000,00 | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 58 |
| Other values (6) | 33 |
| (Missing) | 25 |
| Value | Count | Frequency (%) | |
| DE R$ 81.000,01 A R$ 360.000,00 | 296 | 53.3% | |
| ATE R$ 81.000,00 | 143 | 25.8% | |
| DE R$ 360.000,01 A R$ 1.500.000,00 | 58 | 10.5% | |
| DE R$ 1.500.000,01 A R$ 4.800.000,00 | 18 | 3.2% | |
| DE R$ 4.800.000,01 A R$ 10.000.000,00 | 6 | 1.1% | |
| DE R$ 10.000.000,01 A R$ 30.000.000,00 | 5 | 0.9% | |
| DE R$ 30.000.000,01 A R$ 100.000.000,00 | 2 | 0.4% | |
| DE R$ 300.000.000,01 A R$ 500.000.000,00 | 1 | 0.2% | |
| ACIMA DE 1 BILHAO DE REAIS | 1 | 0.2% | |
| (Missing) | 25 | 4.5% |
| Max length | 40 |
|---|---|
| Mean length | 26.51351351 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_indicador_telefone
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 90.3% |
| Missing (n) | 501 |
| BOA | 54 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| BOA | 54 | 9.7% | |
| (Missing) | 501 | 90.3% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
de_natureza_juridica
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| EMPRESARIO INDIVIDUAL | |
|---|---|
| SOCIEDADE EMPRESARIA LIMITADA | |
| EMPRESA INDIVIDUAL DE RESPONSABILIDADE LIMITADA DE NATUREZA EMPRESARIA | 15 |
| Other values (3) | 3 |
| Value | Count | Frequency (%) | |
| EMPRESARIO INDIVIDUAL | 432 | 77.8% | |
| SOCIEDADE EMPRESARIA LIMITADA | 105 | 18.9% | |
| EMPRESA INDIVIDUAL DE RESPONSABILIDADE LIMITADA DE NATUREZA EMPRESARIA | 15 | 2.7% | |
| SOCIEDADE EMPRESARIA EM NOME COLETIVO | 1 | 0.2% | |
| SOCIEDADE EM CONTA DE PARTICIPACAO | 1 | 0.2% | |
| SOCIEDADE ANONIMA FECHADA | 1 | 0.2% |
| Max length | 70 |
|---|---|
| Mean length | 23.8972973 |
| Min length | 21 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_nivel_atividade
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 0.7% |
| Missing (n) | 4 |
| MEDIA | |
|---|---|
| ALTA | |
| BAIXA | |
| (Missing) | 4 |
| Value | Count | Frequency (%) | |
| MEDIA | 280 | 50.5% | |
| ALTA | 153 | 27.6% | |
| BAIXA | 116 | 20.9% | |
| MUITO BAIXA | 2 | 0.4% | |
| (Missing) | 4 | 0.7% |
| Max length | 11 |
|---|---|
| Mean length | 4.731531532 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_ramo
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO VAREJISTA | |
|---|---|
| BENS DE CONSUMO | 59 |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 496 | 89.4% | |
| BENS DE CONSUMO | 59 | 10.6% |
| Max length | 18 |
|---|---|
| Mean length | 17.68108108 |
| Min length | 15 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
de_saude_rescencia
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 2.0% |
| Missing (n) | 11 |
| ACIMA DE 1 ANO | |
|---|---|
| ATE 1 ANO | 55 |
| SEM INFORMACAO | 36 |
| (Missing) | 11 |
| Value | Count | Frequency (%) | |
| ACIMA DE 1 ANO | 453 | 81.6% | |
| ATE 1 ANO | 55 | 9.9% | |
| SEM INFORMACAO | 36 | 6.5% | |
| (Missing) | 11 | 2.0% |
| Max length | 14 |
|---|---|
| Mean length | 13.28648649 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
de_saude_tributaria
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 1.3% |
| Missing (%) | 2.0% |
| Missing (n) | 11 |
| VERDE | |
|---|---|
| AZUL | |
| AMARELO | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| VERDE | 167 | 30.1% | |
| AZUL | 128 | 23.1% | |
| AMARELO | 113 | 20.4% | |
| CINZA | 86 | 15.5% | |
| LARANJA | 47 | 8.5% | |
| VERMELHO | 3 | 0.5% | |
| (Missing) | 11 | 2.0% |
| Max length | 8 |
|---|---|
| Mean length | 5.322522523 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
dt_situacao
Categorical
| Distinct count | 416 |
|---|---|
| Unique (%) | 75.0% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 2005-11-03 | |
|---|---|
| 2010-10-26 | 4 |
| 2017-02-21 | 3 |
| Other values (413) |
| Value | Count | Frequency (%) | |
| 2005-11-03 | 100 | 18.0% | |
| 2010-10-26 | 4 | 0.7% | |
| 2017-02-21 | 3 | 0.5% | |
| 2006-12-02 | 3 | 0.5% | |
| 2017-03-29 | 3 | 0.5% | |
| 2003-10-04 | 2 | 0.4% | |
| 2010-05-15 | 2 | 0.4% | |
| 2013-08-26 | 2 | 0.4% | |
| 2004-10-30 | 2 | 0.4% | |
| 2018-08-16 | 2 | 0.4% | |
| Other values (406) | 432 | 77.8% |
| Max length | 10 |
|---|---|
| Mean length | 10 |
| Min length | 10 |
| Contains chars | False |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
empsetorcensitariofaixarendapopulacao
Numeric
| Distinct count | 392 |
|---|---|
| Unique (%) | 70.6% |
| Missing (%) | 27.0% |
| Missing (n) | 150 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1130.532765 |
|---|---|
| Minimum | 110.3 |
| Maximum | 6201.91 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 110.3 |
|---|---|
| 5-th percentile | 401.95 |
| Q1 | 643.07 |
| Median | 905.94 |
| Q3 | 1303.82 |
| 95-th percentile | 2460.026 |
| Maximum | 6201.91 |
| Range | 6091.61 |
| Interquartile range | 660.75 |
Descriptive statistics
| Standard deviation | 815.0382028 |
|---|---|
| Coef of variation | 0.7209328449 |
| Kurtosis | 9.551738645 |
| Mean | 1130.532765 |
| MAD | 537.2669602 |
| Skewness | 2.705259278 |
| Sum | 457865.77 |
| Variance | 664287.272 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 555.9 | 2 | 0.4% | |
| 1316.48 | 2 | 0.4% | |
| 2361.38 | 2 | 0.4% | |
| 949.29 | 2 | 0.4% | |
| 609.88 | 2 | 0.4% | |
| 4640.24 | 2 | 0.4% | |
| 1660.52 | 2 | 0.4% | |
| 946.33 | 2 | 0.4% | |
| 1461.93 | 2 | 0.4% | |
| 2019.39 | 2 | 0.4% | |
| Other values (381) | 385 | 69.4% | |
| (Missing) | 150 | 27.0% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 110.3 | 1 | 0.2% | |
| 268.85 | 1 | 0.2% | |
| 284.92 | 1 | 0.2% | |
| 285.63 | 1 | 0.2% | |
| 305.93 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 6201.91 | 1 | 0.2% | |
| 5049.8 | 1 | 0.2% | |
| 4815.33 | 1 | 0.2% | |
| 4796.69 | 1 | 0.2% | |
| 4655.76 | 1 | 0.2% |
faturamento_est_coligados
Numeric
| Distinct count | 42 |
|---|---|
| Unique (%) | 7.6% |
| Missing (%) | 88.8% |
| Missing (n) | 493 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 23987448.6 |
|---|---|
| Minimum | 61819.19922 |
| Maximum | 822198302.8 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 61819.19922 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 210000 |
| Median | 605457.5938 |
| Q3 | 1985474.438 |
| 95-th percentile | 33466988.08 |
| Maximum | 822198302.8 |
| Range | 822136483.6 |
| Interquartile range | 1775474.438 |
Descriptive statistics
| Standard deviation | 119192027 |
|---|---|
| Coef of variation | 4.968933088 |
| Kurtosis | 36.80524454 |
| Mean | 23987448.6 |
| MAD | 41381008.51 |
| Skewness | 5.974394413 |
| Sum | 1487221813 |
| Variance | 1.420673931e+16 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 14 | 2.5% | |
| 420000 | 5 | 0.9% | |
| 370915.1875 | 3 | 0.5% | |
| 457276.7969 | 2 | 0.4% | |
| 556372.8125 | 2 | 0.4% | |
| 10929264 | 1 | 0.2% | |
| 630000 | 1 | 0.2% | |
| 5811005 | 1 | 0.2% | |
| 8820000 | 1 | 0.2% | |
| 1458191.992 | 1 | 0.2% | |
| Other values (31) | 31 | 5.6% | |
| (Missing) | 493 | 88.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 61819.19922 | 1 | 0.2% | |
| 185457.5938 | 1 | 0.2% | |
| 206064 | 1 | 0.2% | |
| 210000 | 14 | 2.5% | |
| 370915.1875 | 3 | 0.5% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 822198302.8 | 1 | 0.2% | |
| 470915908.8 | 1 | 0.2% | |
| 51000840 | 1 | 0.2% | |
| 34646006.56 | 1 | 0.2% | |
| 11065636.88 | 1 | 0.2% |
faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9798738459 |
|---|
fl_antt
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 4 |
| Value | Count | Frequency (%) | |
| False | 551 | 99.3% | |
| True | 4 | 0.7% |
fl_email
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 300 | 54.1% | |
| True | 255 | 45.9% |
fl_epp
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_ltda
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_matriz
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False | 36 |
| Value | Count | Frequency (%) | |
| True | 519 | 93.5% | |
| False | 36 | 6.5% |
fl_me
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 2 |
| Value | Count | Frequency (%) | |
| False | 553 | 99.6% | |
| True | 2 | 0.4% |
fl_mei
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 337 | 60.7% | |
| True | 218 | 39.3% |
fl_optante_simei
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 18.2% |
| Missing (n) | 101 |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) | |
| False | 316 | 56.9% | |
| True | 138 | 24.9% | |
| (Missing) | 101 | 18.2% |
fl_optante_simples
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 18.2% |
| Missing (n) | 101 |
| True | |
|---|---|
| False | |
| (Missing) |
| Value | Count | Frequency (%) | |
| True | 286 | 51.5% | |
| False | 168 | 30.3% | |
| (Missing) | 101 | 18.2% |
fl_passivel_iss
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 442 | 79.6% | |
| True | 113 | 20.4% |
fl_rm
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| NAO | |
|---|---|
| SIM |
| Value | Count | Frequency (%) | |
| NAO | 311 | 56.0% | |
| SIM | 244 | 44.0% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
fl_sa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 9 |
| Value | Count | Frequency (%) | |
| False | 546 | 98.4% | |
| True | 9 | 1.6% |
fl_simples_irregular
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_spa
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_st_especial
Constant
This variable is constant and should be ignored for analysis
| Constant value | False |
|---|
fl_telefone
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) | |
| True | 400 | 72.1% | |
| False | 155 | 27.9% |
fl_veiculo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| False | |
|---|---|
| True | 38 |
| Value | Count | Frequency (%) | |
| False | 517 | 93.2% | |
| True | 38 | 6.8% |
grau_instrucao_macro_analfabeto
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.8% |
| Missing (n) | 554 |
| 1 | 1 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 1 | 0.2% | |
| (Missing) | 554 | 99.8% |
grau_instrucao_macro_desconhecido
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
grau_instrucao_macro_escolaridade_fundamental
Numeric
| Distinct count | 8 |
|---|---|
| Unique (%) | 1.4% |
| Missing (%) | 95.5% |
| Missing (n) | 530 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.56 |
|---|---|
| Minimum | 1 |
| Maximum | 25 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 3 |
| 95-th percentile | 13.8 |
| Maximum | 25 |
| Range | 24 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 5.60565191 |
|---|---|
| Coef of variation | 1.574621323 |
| Kurtosis | 9.144672007 |
| Mean | 3.56 |
| MAD | 3.3408 |
| Skewness | 2.982660228 |
| Sum | 89 |
| Variance | 31.42333333 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 13 | 2.3% | |
| 3 | 4 | 0.7% | |
| 2 | 4 | 0.7% | |
| 4 | 1 | 0.2% | |
| 14 | 1 | 0.2% | |
| 25 | 1 | 0.2% | |
| 13 | 1 | 0.2% | |
| (Missing) | 530 | 95.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 13 | 2.3% | |
| 2 | 4 | 0.7% | |
| 3 | 4 | 0.7% | |
| 4 | 1 | 0.2% | |
| 13 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 25 | 1 | 0.2% | |
| 14 | 1 | 0.2% | |
| 13 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 3 | 4 | 0.7% |
grau_instrucao_macro_escolaridade_media
Numeric
| Distinct count | 18 |
|---|---|
| Unique (%) | 3.2% |
| Missing (%) | 84.5% |
| Missing (n) | 469 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 5.337209302 |
|---|---|
| Minimum | 1 |
| Maximum | 70 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 3 |
| Q3 | 6 |
| 95-th percentile | 13.75 |
| Maximum | 70 |
| Range | 69 |
| Interquartile range | 5 |
Descriptive statistics
| Standard deviation | 9.030159226 |
|---|---|
| Coef of variation | 1.691925258 |
| Kurtosis | 32.58669716 |
| Mean | 5.337209302 |
| MAD | 4.664413196 |
| Skewness | 5.124474064 |
| Sum | 459 |
| Variance | 81.54377565 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 33 | 5.9% | |
| 3 | 9 | 1.6% | |
| 2 | 8 | 1.4% | |
| 6 | 8 | 1.4% | |
| 4 | 6 | 1.1% | |
| 5 | 5 | 0.9% | |
| 11 | 3 | 0.5% | |
| 9 | 3 | 0.5% | |
| 7 | 2 | 0.4% | |
| 13 | 2 | 0.4% | |
| Other values (7) | 7 | 1.3% | |
| (Missing) | 469 | 84.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 33 | 5.9% | |
| 2 | 8 | 1.4% | |
| 3 | 9 | 1.6% | |
| 4 | 6 | 1.1% | |
| 5 | 5 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 70 | 1 | 0.2% | |
| 38 | 1 | 0.2% | |
| 24 | 1 | 0.2% | |
| 18 | 1 | 0.2% | |
| 14 | 1 | 0.2% |
grau_instrucao_macro_escolaridade_superior
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 95.9% |
| Missing (n) | 532 |
| 1 | 17 |
|---|---|
| 2 | 4 |
| 4 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 17 | 3.1% | |
| 2 | 4 | 0.7% | |
| 4 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| (Missing) | 532 | 95.9% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
id
Categorical, Unique
| First 5 values |
|---|
| 00b5cef5652ca3e33f15cbab7249b9d1dc06c82df26d76... |
| 00b79c8f0f8a5ac93fa441f42b1619846dd2aaa5ddf2b3... |
| 013ce96aef919fb847b8f0cfc882a1ebad64e291e78ff6... |
| 01f73b5d6d29e955d5f8862698a291a961a3fde5848de6... |
| 02183ea423d627596a8a243492f07e048d3cb59ea1114d... |
| Last 5 values |
|---|
| feec91169df886e0b49f9625deb4fbb8614e0002788d8f... |
| ff4ad7a26c1ef7ec42bf5ec66888c43203f44bac4e9d53... |
| ff633fd9711d44def1878bf872343c4c32ca97a3e2c290... |
| ff94dca46aa53e42898307c53a702f357bb19cedb4b662... |
| ffd7c9bf8cb8089dffcceb22e5d84dc2814c5658f27e04... |
First 5 values
| Value | Count | Frequency (%) | |
| 00b5cef5652ca3e33f15cbab7249b9d1dc06c82df26d76d94f94834d2aba3b29 | 1 | 0.2% | |
| 00b79c8f0f8a5ac93fa441f42b1619846dd2aaa5ddf2b3c3c79c599e1202a16d | 1 | 0.2% | |
| 013ce96aef919fb847b8f0cfc882a1ebad64e291e78ff61a1988ac3d30c0fe39 | 1 | 0.2% | |
| 01f73b5d6d29e955d5f8862698a291a961a3fde5848de6018e6e251a9bd7aeeb | 1 | 0.2% | |
| 02183ea423d627596a8a243492f07e048d3cb59ea1114d80704dac0d1d35797d | 1 | 0.2% |
Last 5 values
| Value | Count | Frequency (%) | |
| ffd7c9bf8cb8089dffcceb22e5d84dc2814c5658f27e04cc8b9c69f56f2e94fd | 1 | 0.2% | |
| ff94dca46aa53e42898307c53a702f357bb19cedb4b662aff5e1678f015d6637 | 1 | 0.2% | |
| ff633fd9711d44def1878bf872343c4c32ca97a3e2c290ab7214d5e6a99cab26 | 1 | 0.2% | |
| ff4ad7a26c1ef7ec42bf5ec66888c43203f44bac4e9d532c43affbd9273a724e | 1 | 0.2% | |
| feec91169df886e0b49f9625deb4fbb8614e0002788d8f88fbdbc3f80d464b28 | 1 | 0.2% |
idade_acima_de_58
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_fundamental and should be ignored for analysis
| Correlation | 0.9173649446 |
|---|
idade_ate_18
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.3% |
| Missing (n) | 551 |
| 1 | 4 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 4 | 0.7% | |
| (Missing) | 551 | 99.3% |
idade_de_19_a_23
Highly correlated
This variable is highly correlated with idade_acima_de_58 and should be ignored for analysis
| Correlation | 0.9432422183 |
|---|
idade_de_24_a_28
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 91.7% |
| Missing (n) | 509 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.652173913 |
|---|---|
| Minimum | 1 |
| Maximum | 11 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 3 |
| 95-th percentile | 8.75 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 2.442379453 |
|---|---|
| Coef of variation | 0.9208971707 |
| Kurtosis | 3.362463233 |
| Mean | 2.652173913 |
| MAD | 1.792060491 |
| Skewness | 1.92634963 |
| Sum | 122 |
| Variance | 5.965217391 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 21 | 3.8% | |
| 2 | 10 | 1.8% | |
| 5 | 4 | 0.7% | |
| 3 | 4 | 0.7% | |
| 4 | 3 | 0.5% | |
| 9 | 2 | 0.4% | |
| 8 | 1 | 0.2% | |
| 11 | 1 | 0.2% | |
| (Missing) | 509 | 91.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 21 | 3.8% | |
| 2 | 10 | 1.8% | |
| 3 | 4 | 0.7% | |
| 4 | 3 | 0.5% | |
| 5 | 4 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 11 | 1 | 0.2% | |
| 9 | 2 | 0.4% | |
| 8 | 1 | 0.2% | |
| 5 | 4 | 0.7% | |
| 4 | 3 | 0.5% |
idade_de_29_a_33
Highly correlated
This variable is highly correlated with grau_instrucao_macro_escolaridade_media and should be ignored for analysis
| Correlation | 0.9270644425 |
|---|
idade_de_34_a_38
Numeric
| Distinct count | 8 |
|---|---|
| Unique (%) | 1.4% |
| Missing (%) | 92.4% |
| Missing (n) | 513 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.238095238 |
|---|---|
| Minimum | 1 |
| Maximum | 12 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 5.95 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.535691793 |
|---|---|
| Coef of variation | 1.132968673 |
| Kurtosis | 9.435850017 |
| Mean | 2.238095238 |
| MAD | 1.56462585 |
| Skewness | 3.00825859 |
| Sum | 94 |
| Variance | 6.429732869 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 25 | 4.5% | |
| 2 | 8 | 1.4% | |
| 3 | 3 | 0.5% | |
| 5 | 2 | 0.4% | |
| 12 | 2 | 0.4% | |
| 4 | 1 | 0.2% | |
| 6 | 1 | 0.2% | |
| (Missing) | 513 | 92.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 25 | 4.5% | |
| 2 | 8 | 1.4% | |
| 3 | 3 | 0.5% | |
| 4 | 1 | 0.2% | |
| 5 | 2 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 12 | 2 | 0.4% | |
| 6 | 1 | 0.2% | |
| 5 | 2 | 0.4% | |
| 4 | 1 | 0.2% | |
| 3 | 3 | 0.5% |
idade_de_39_a_43
Numeric
| Distinct count | 9 |
|---|---|
| Unique (%) | 1.6% |
| Missing (%) | 94.2% |
| Missing (n) | 523 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.28125 |
|---|---|
| Minimum | 1 |
| Maximum | 11 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 8.9 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.726623518 |
|---|---|
| Coef of variation | 1.195232227 |
| Kurtosis | 4.276323234 |
| Mean | 2.28125 |
| MAD | 1.89453125 |
| Skewness | 2.270870962 |
| Sum | 73 |
| Variance | 7.434475806 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 23 | 4.1% | |
| 2 | 3 | 0.5% | |
| 6 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 11 | 1 | 0.2% | |
| 8 | 1 | 0.2% | |
| 10 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| (Missing) | 523 | 94.2% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 23 | 4.1% | |
| 2 | 3 | 0.5% | |
| 4 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 6 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 11 | 1 | 0.2% | |
| 10 | 1 | 0.2% | |
| 8 | 1 | 0.2% | |
| 6 | 1 | 0.2% | |
| 5 | 1 | 0.2% |
idade_de_44_a_48
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.9% |
| Missing (%) | 96.2% |
| Missing (n) | 534 |
| 1 | 16 |
|---|---|
| 2 | 3 |
| 5 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 16 | 2.9% | |
| 2 | 3 | 0.5% | |
| 5 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| (Missing) | 534 | 96.2% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
idade_de_49_a_53
Highly correlated
This variable is highly correlated with idade_de_29_a_33 and should be ignored for analysis
| Correlation | 0.9490484453 |
|---|
idade_de_54_a_58
Highly correlated
This variable is highly correlated with idade_de_34_a_38 and should be ignored for analysis
| Correlation | 0.9779514754 |
|---|
idade_emp_cat
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| 1 a 5 | |
|---|---|
| 5 a 10 | |
| > 20 | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| 1 a 5 | 157 | 28.3% | |
| 5 a 10 | 144 | 25.9% | |
| > 20 | 92 | 16.6% | |
| <= 1 | 56 | 10.1% | |
| 10 a 15 | 55 | 9.9% | |
| 15 a 20 | 51 | 9.2% |
| Max length | 7 |
|---|---|
| Mean length | 5.374774775 |
| Min length | 4 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | True |
| Contains non-words | True |
idade_empresa_anos
Numeric
| Distinct count | 522 |
|---|---|
| Unique (%) | 94.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 10.13924719 |
|---|---|
| Minimum | 0.05205479452 |
| Maximum | 47.51780822 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 0.05205479452 |
|---|---|
| 5-th percentile | 0.5575342466 |
| Q1 | 2.924657534 |
| Median | 6.709589041 |
| Q3 | 15.49452055 |
| 95-th percentile | 30.3 |
| Maximum | 47.51780822 |
| Range | 47.46575342 |
| Interquartile range | 12.56986301 |
Descriptive statistics
| Standard deviation | 9.641625411 |
|---|---|
| Coef of variation | 0.95092123 |
| Kurtosis | 1.083164559 |
| Mean | 10.13924719 |
| MAD | 7.67673115 |
| Skewness | 1.280516564 |
| Sum | 5627.282192 |
| Variance | 92.96094057 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 8.010958904 | 4 | 0.7% | |
| 1.682191781 | 3 | 0.5% | |
| 1.583561644 | 3 | 0.5% | |
| 9.750684932 | 2 | 0.4% | |
| 1.15890411 | 2 | 0.4% | |
| 1.468493151 | 2 | 0.4% | |
| 6.416438356 | 2 | 0.4% | |
| 3.287671233 | 2 | 0.4% | |
| 1.597260274 | 2 | 0.4% | |
| 11.11232877 | 2 | 0.4% | |
| Other values (512) | 531 | 95.7% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0.05205479452 | 2 | 0.4% | |
| 0.1095890411 | 1 | 0.2% | |
| 0.1205479452 | 1 | 0.2% | |
| 0.1506849315 | 1 | 0.2% | |
| 0.1643835616 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 47.51780822 | 1 | 0.2% | |
| 46.76164384 | 1 | 0.2% | |
| 46.11506849 | 1 | 0.2% | |
| 43.18630137 | 1 | 0.2% | |
| 42.7260274 | 1 | 0.2% |
idade_maxima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_antiga_ativa and should be ignored for analysis
| Correlation | 1 |
|---|
idade_maxima_socios
Numeric
| Distinct count | 63 |
|---|---|
| Unique (%) | 11.4% |
| Missing (%) | 34.8% |
| Missing (n) | 193 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 42.87292818 |
|---|---|
| Minimum | 9 |
| Maximum | 83 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 32 |
| Median | 41 |
| Q3 | 51 |
| 95-th percentile | 67.95 |
| Maximum | 83 |
| Range | 74 |
| Interquartile range | 19 |
Descriptive statistics
| Standard deviation | 13.48664618 |
|---|---|
| Coef of variation | 0.3145725462 |
| Kurtosis | -0.1483653675 |
| Mean | 42.87292818 |
| MAD | 10.89212783 |
| Skewness | 0.4970038169 |
| Sum | 15520 |
| Variance | 181.8896252 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 29 | 18 | 3.2% | |
| 37 | 13 | 2.3% | |
| 41 | 12 | 2.2% | |
| 34 | 12 | 2.2% | |
| 50 | 12 | 2.2% | |
| 40 | 11 | 2.0% | |
| 38 | 11 | 2.0% | |
| 42 | 11 | 2.0% | |
| 44 | 11 | 2.0% | |
| 49 | 10 | 1.8% | |
| Other values (52) | 241 | 43.4% | |
| (Missing) | 193 | 34.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 9 | 1 | 0.2% | |
| 18 | 2 | 0.4% | |
| 20 | 4 | 0.7% | |
| 21 | 2 | 0.4% | |
| 23 | 4 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 83 | 1 | 0.2% | |
| 82 | 1 | 0.2% | |
| 79 | 1 | 0.2% | |
| 78 | 1 | 0.2% | |
| 76 | 2 | 0.4% |
idade_media_coligadas
Numeric
| Distinct count | 63 |
|---|---|
| Unique (%) | 11.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 162.9516377 |
|---|---|
| Minimum | 4.066666667 |
| Maximum | 484.3666667 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 4.066666667 |
|---|---|
| 5-th percentile | 17.64666667 |
| Q1 | 81.2 |
| Median | 135.4666667 |
| Q3 | 225.9833333 |
| 95-th percentile | 373.8509524 |
| Maximum | 484.3666667 |
| Range | 480.3 |
| Interquartile range | 144.7833333 |
Descriptive statistics
| Standard deviation | 118.4303349 |
|---|---|
| Coef of variation | 0.7267821088 |
| Kurtosis | 0.2608933621 |
| Mean | 162.9516377 |
| MAD | 96.12434557 |
| Skewness | 0.8460548977 |
| Sum | 10265.95318 |
| Variance | 14025.74422 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 27.73333333 | 2 | 0.4% | |
| 312.1 | 1 | 0.2% | |
| 86.5 | 1 | 0.2% | |
| 244.1979798 | 1 | 0.2% | |
| 93.73333333 | 1 | 0.2% | |
| 135.4666667 | 1 | 0.2% | |
| 476.0666667 | 1 | 0.2% | |
| 172.2 | 1 | 0.2% | |
| 194.3 | 1 | 0.2% | |
| 224.95 | 1 | 0.2% | |
| Other values (52) | 52 | 9.4% | |
| (Missing) | 492 | 88.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 4.066666667 | 1 | 0.2% | |
| 9.066666667 | 1 | 0.2% | |
| 10.16666667 | 1 | 0.2% | |
| 17.43333333 | 1 | 0.2% | |
| 19.56666667 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 484.3666667 | 1 | 0.2% | |
| 476.0666667 | 1 | 0.2% | |
| 426.4333333 | 1 | 0.2% | |
| 374.4714286 | 1 | 0.2% | |
| 368.2666667 | 1 | 0.2% |
idade_media_coligadas_ativas
Highly correlated
This variable is highly correlated with idade_media_coligadas and should be ignored for analysis
| Correlation | 1 |
|---|
idade_media_coligadas_baixadas
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
idade_media_socios
Highly correlated
This variable is highly correlated with idade_maxima_socios and should be ignored for analysis
| Correlation | 0.9663386807 |
|---|
idade_minima_coligadas
Highly correlated
This variable is highly correlated with coligada_mais_nova_ativa and should be ignored for analysis
| Correlation | 1 |
|---|
idade_minima_socios
Highly correlated
This variable is highly correlated with idade_media_socios and should be ignored for analysis
| Correlation | 0.9672309258 |
|---|
max_faturamento_est_coligados
Highly correlated
This variable is highly correlated with faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9753288607 |
|---|
max_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9208848359 |
|---|
max_filiais_coligados
Numeric
| Distinct count | 10 |
|---|---|
| Unique (%) | 1.8% |
| Missing (%) | 97.3% |
| Missing (n) | 540 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 9.2 |
|---|---|
| Minimum | 1 |
| Maximum | 46 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 3 |
| Q3 | 10 |
| 95-th percentile | 36.2 |
| Maximum | 46 |
| Range | 45 |
| Interquartile range | 9 |
Descriptive statistics
| Standard deviation | 13.27833897 |
|---|---|
| Coef of variation | 1.443297714 |
| Kurtosis | 3.786404955 |
| Mean | 9.2 |
| MAD | 9.626666667 |
| Skewness | 2.058919419 |
| Sum | 138 |
| Variance | 176.3142857 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.9% | |
| 5 | 2 | 0.4% | |
| 3 | 2 | 0.4% | |
| 6 | 1 | 0.2% | |
| 46 | 1 | 0.2% | |
| 2 | 1 | 0.2% | |
| 32 | 1 | 0.2% | |
| 14 | 1 | 0.2% | |
| 17 | 1 | 0.2% | |
| (Missing) | 540 | 97.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.9% | |
| 2 | 1 | 0.2% | |
| 3 | 2 | 0.4% | |
| 5 | 2 | 0.4% | |
| 6 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 46 | 1 | 0.2% | |
| 32 | 1 | 0.2% | |
| 17 | 1 | 0.2% | |
| 14 | 1 | 0.2% | |
| 6 | 1 | 0.2% |
max_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9988162351 |
|---|
max_meses_servicos
Numeric
| Distinct count | 87 |
|---|---|
| Unique (%) | 15.7% |
| Missing (%) | 81.8% |
| Missing (n) | 454 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 72.98976898 |
|---|---|
| Minimum | 5.8 |
| Maximum | 309.3333333 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 5.8 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 25.23333333 |
| Median | 54.7 |
| Q3 | 93.26666667 |
| 95-th percentile | 198.7666667 |
| Maximum | 309.3333333 |
| Range | 303.5333333 |
| Interquartile range | 68.03333333 |
Descriptive statistics
| Standard deviation | 61.44349522 |
|---|---|
| Coef of variation | 0.8418096959 |
| Kurtosis | 2.367120623 |
| Mean | 72.98976898 |
| MAD | 46.91584485 |
| Skewness | 1.488274524 |
| Sum | 7371.966667 |
| Variance | 3775.303105 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 89.2 | 4 | 0.7% | |
| 21.13333333 | 3 | 0.5% | |
| 25.23333333 | 2 | 0.4% | |
| 93.26666667 | 2 | 0.4% | |
| 23.2 | 2 | 0.4% | |
| 18.13333333 | 2 | 0.4% | |
| 17.13333333 | 2 | 0.4% | |
| 49.6 | 2 | 0.4% | |
| 81.06666667 | 2 | 0.4% | |
| 43.53333333 | 2 | 0.4% | |
| Other values (76) | 78 | 14.1% | |
| (Missing) | 454 | 81.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 5.8 | 1 | 0.2% | |
| 5.966666667 | 1 | 0.2% | |
| 6.966666667 | 1 | 0.2% | |
| 9.366666667 | 1 | 0.2% | |
| 9.9 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 309.3333333 | 1 | 0.2% | |
| 253.5 | 1 | 0.2% | |
| 252.4333333 | 1 | 0.2% | |
| 234.2666667 | 1 | 0.2% | |
| 200.7666667 | 1 | 0.2% |
max_meses_servicos_all
Numeric
| Distinct count | 120 |
|---|---|
| Unique (%) | 21.6% |
| Missing (%) | 76.4% |
| Missing (n) | 424 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 103.3498728 |
|---|---|
| Minimum | 1.033333333 |
| Maximum | 5014.966667 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1.033333333 |
|---|---|
| 5-th percentile | 5.883333333 |
| Q1 | 23.23333333 |
| Median | 46.66666667 |
| Q3 | 89.2 |
| 95-th percentile | 196.2166667 |
| Maximum | 5014.966667 |
| Range | 5013.933333 |
| Interquartile range | 65.96666667 |
Descriptive statistics
| Standard deviation | 436.3619417 |
|---|---|
| Coef of variation | 4.222181702 |
| Kurtosis | 126.2351103 |
| Mean | 103.3498728 |
| MAD | 98.03210769 |
| Skewness | 11.13790553 |
| Sum | 13538.83333 |
| Variance | 190411.7442 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 89.2 | 4 | 0.7% | |
| 49.6 | 2 | 0.4% | |
| 23.2 | 2 | 0.4% | |
| 52.76666667 | 2 | 0.4% | |
| 81.06666667 | 2 | 0.4% | |
| 43.53333333 | 2 | 0.4% | |
| 17.13333333 | 2 | 0.4% | |
| 93.26666667 | 2 | 0.4% | |
| 28.43333333 | 2 | 0.4% | |
| 95.23333333 | 2 | 0.4% | |
| Other values (109) | 109 | 19.6% | |
| (Missing) | 424 | 76.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1.033333333 | 1 | 0.2% | |
| 2.033333333 | 1 | 0.2% | |
| 3.433333333 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 4.6 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 5014.966667 | 1 | 0.2% | |
| 309.3333333 | 1 | 0.2% | |
| 252.4333333 | 1 | 0.2% | |
| 234.2666667 | 1 | 0.2% | |
| 213.4 | 1 | 0.2% |
max_vl_folha_coligados
Highly correlated
This variable is highly correlated with max_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9432797848 |
|---|
max_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9830355726 |
|---|
media_faturamento_est_coligados
Highly correlated
This variable is highly correlated with max_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9460011571 |
|---|
media_faturamento_est_coligados_gp
Numeric
| Distinct count | 40 |
|---|---|
| Unique (%) | 7.2% |
| Missing (%) | 88.8% |
| Missing (n) | 493 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 6347711.473 |
|---|---|
| Minimum | 61819.19922 |
| Maximum | 240723968 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 61819.19922 |
|---|---|
| 5-th percentile | 210000 |
| Q1 | 210000 |
| Median | 435000 |
| Q3 | 1067895.818 |
| 95-th percentile | 9344487.25 |
| Maximum | 240723968 |
| Range | 240662148.8 |
| Interquartile range | 857895.8177 |
Descriptive statistics
| Standard deviation | 31643431.39 |
|---|---|
| Coef of variation | 4.98501413 |
| Kurtosis | 51.56392603 |
| Mean | 6347711.473 |
| MAD | 10327019.31 |
| Skewness | 7.011777806 |
| Sum | 393558111.3 |
| Variance | 1.00130675e+15 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 210000 | 19 | 3.4% | |
| 370915.1875 | 3 | 0.5% | |
| 228638.3984 | 2 | 0.4% | |
| 930000 | 2 | 0.4% | |
| 556372.8125 | 2 | 0.4% | |
| 61819.19922 | 1 | 0.2% | |
| 1470000 | 1 | 0.2% | |
| 26555480.93 | 1 | 0.2% | |
| 420000 | 1 | 0.2% | |
| 5811005 | 1 | 0.2% | |
| Other values (29) | 29 | 5.2% | |
| (Missing) | 493 | 88.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 61819.19922 | 1 | 0.2% | |
| 185457.5938 | 1 | 0.2% | |
| 206064 | 1 | 0.2% | |
| 210000 | 19 | 3.4% | |
| 228638.3984 | 2 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 240723968 | 1 | 0.2% | |
| 68718535.64 | 1 | 0.2% | |
| 26555480.93 | 1 | 0.2% | |
| 9530460 | 1 | 0.2% | |
| 5811005 | 1 | 0.2% |
media_filiais_coligados
Highly correlated
This variable is highly correlated with max_filiais_coligados and should be ignored for analysis
| Correlation | 0.984817247 |
|---|
media_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with media_filiais_coligados and should be ignored for analysis
| Correlation | 0.9290150933 |
|---|
media_meses_servicos
Numeric
| Distinct count | 97 |
|---|---|
| Unique (%) | 17.5% |
| Missing (%) | 81.8% |
| Missing (n) | 454 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 49.92900583 |
|---|---|
| Minimum | 5.406666667 |
| Maximum | 228.1555556 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 5.406666667 |
|---|---|
| 5-th percentile | 6.416666667 |
| Q1 | 22.32307692 |
| Median | 41.96666667 |
| Q3 | 65.91666667 |
| 95-th percentile | 131.6916667 |
| Maximum | 228.1555556 |
| Range | 222.7488889 |
| Interquartile range | 43.59358974 |
Descriptive statistics
| Standard deviation | 40.96894942 |
|---|---|
| Coef of variation | 0.8205440653 |
| Kurtosis | 4.839175601 |
| Mean | 49.92900583 |
| MAD | 28.94381254 |
| Skewness | 1.924815809 |
| Sum | 5042.829589 |
| Variance | 1678.454816 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 18.13333333 | 2 | 0.4% | |
| 23.2 | 2 | 0.4% | |
| 66.83333333 | 2 | 0.4% | |
| 23.53333333 | 2 | 0.4% | |
| 49.6 | 2 | 0.4% | |
| 12.74444444 | 1 | 0.2% | |
| 71.13333333 | 1 | 0.2% | |
| 14.4525641 | 1 | 0.2% | |
| 11.53333333 | 1 | 0.2% | |
| 22.1 | 1 | 0.2% | |
| Other values (86) | 86 | 15.5% | |
| (Missing) | 454 | 81.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 5.406666667 | 1 | 0.2% | |
| 5.716666667 | 1 | 0.2% | |
| 5.955555556 | 1 | 0.2% | |
| 6.15 | 1 | 0.2% | |
| 6.333333333 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 228.1555556 | 1 | 0.2% | |
| 199.7666667 | 1 | 0.2% | |
| 176.9333333 | 1 | 0.2% | |
| 148 | 1 | 0.2% | |
| 142.9 | 1 | 0.2% |
media_meses_servicos_all
Highly correlated
This variable is highly correlated with max_meses_servicos_all and should be ignored for analysis
| Correlation | 0.9651590156 |
|---|
media_vl_folha_coligados
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9023282866 |
|---|
media_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with media_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9956423815 |
|---|
meses_ultima_contratacaco
Numeric
| Distinct count | 103 |
|---|---|
| Unique (%) | 18.6% |
| Missing (%) | 76.4% |
| Missing (n) | 424 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 35.05928753 |
|---|---|
| Minimum | 2.166666667 |
| Maximum | 153.1 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 2.166666667 |
|---|---|
| 5-th percentile | 2.933333333 |
| Q1 | 8.033333333 |
| Median | 23.2 |
| Q3 | 54.66666667 |
| 95-th percentile | 92.75 |
| Maximum | 153.1 |
| Range | 150.9333333 |
| Interquartile range | 46.63333333 |
Descriptive statistics
| Standard deviation | 31.5211692 |
|---|---|
| Coef of variation | 0.8990818529 |
| Kurtosis | 1.30509277 |
| Mean | 35.05928753 |
| MAD | 25.68343725 |
| Skewness | 1.182480184 |
| Sum | 4592.766667 |
| Variance | 993.5841075 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 2.933333333 | 6 | 1.1% | |
| 6.966666667 | 4 | 0.7% | |
| 21.13333333 | 3 | 0.5% | |
| 22.13333333 | 3 | 0.5% | |
| 25.23333333 | 3 | 0.5% | |
| 23.2 | 3 | 0.5% | |
| 4.966666667 | 3 | 0.5% | |
| 5.966666667 | 2 | 0.4% | |
| 58.66666667 | 2 | 0.4% | |
| 18.13333333 | 2 | 0.4% | |
| Other values (92) | 100 | 18.0% | |
| (Missing) | 424 | 76.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 2.166666667 | 1 | 0.2% | |
| 2.2 | 1 | 0.2% | |
| 2.233333333 | 1 | 0.2% | |
| 2.533333333 | 1 | 0.2% | |
| 2.933333333 | 6 | 1.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 153.1 | 1 | 0.2% | |
| 142.9 | 1 | 0.2% | |
| 112.3333333 | 1 | 0.2% | |
| 104.0333333 | 1 | 0.2% | |
| 102.3666667 | 1 | 0.2% |
min_faturamento_est_coligados
Highly correlated
This variable is highly correlated with media_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9401087019 |
|---|
min_faturamento_est_coligados_gp
Highly correlated
This variable is highly correlated with min_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9857654776 |
|---|
min_filiais_coligados
Highly correlated
This variable is highly correlated with media_vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.9113088103 |
|---|
min_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with min_faturamento_est_coligados_gp and should be ignored for analysis
| Correlation | 0.9974669262 |
|---|
min_meses_servicos
Numeric
| Distinct count | 79 |
|---|---|
| Unique (%) | 14.2% |
| Missing (%) | 81.8% |
| Missing (n) | 454 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 33.61551155 |
|---|---|
| Minimum | 2.166666667 |
| Maximum | 192.6333333 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 2.166666667 |
|---|---|
| 5-th percentile | 2.933333333 |
| Q1 | 5.966666667 |
| Median | 22.13333333 |
| Q3 | 48.56666667 |
| 95-th percentile | 104.0333333 |
| Maximum | 192.6333333 |
| Range | 190.4666667 |
| Interquartile range | 42.6 |
Descriptive statistics
| Standard deviation | 37.4105082 |
|---|---|
| Coef of variation | 1.112894211 |
| Kurtosis | 4.072852971 |
| Mean | 33.61551155 |
| MAD | 27.72425579 |
| Skewness | 1.910373555 |
| Sum | 3395.166667 |
| Variance | 1399.546124 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 2.933333333 | 6 | 1.1% | |
| 23.2 | 4 | 0.7% | |
| 6.966666667 | 4 | 0.7% | |
| 4.966666667 | 3 | 0.5% | |
| 22.13333333 | 2 | 0.4% | |
| 49.6 | 2 | 0.4% | |
| 25.23333333 | 2 | 0.4% | |
| 38.43333333 | 2 | 0.4% | |
| 3.833333333 | 2 | 0.4% | |
| 61.76666667 | 2 | 0.4% | |
| Other values (68) | 72 | 13.0% | |
| (Missing) | 454 | 81.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 2.166666667 | 1 | 0.2% | |
| 2.2 | 1 | 0.2% | |
| 2.233333333 | 1 | 0.2% | |
| 2.533333333 | 1 | 0.2% | |
| 2.933333333 | 6 | 1.1% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 192.6333333 | 1 | 0.2% | |
| 153.1 | 1 | 0.2% | |
| 148 | 1 | 0.2% | |
| 142.9 | 1 | 0.2% | |
| 117.5666667 | 1 | 0.2% |
min_meses_servicos_all
Numeric
| Distinct count | 101 |
|---|---|
| Unique (%) | 18.2% |
| Missing (%) | 76.4% |
| Missing (n) | 424 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 13.21882952 |
|---|---|
| Minimum | -0.7 |
| Maximum | 142.9 |
| Zeros (%) | 0.9% |
Quantile statistics
| Minimum | -0.7 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.033333333 |
| Median | 5.033333333 |
| Q3 | 17.28333333 |
| 95-th percentile | 53.36666667 |
| Maximum | 142.9 |
| Range | 143.6 |
| Interquartile range | 16.25 |
Descriptive statistics
| Standard deviation | 20.45002752 |
|---|---|
| Coef of variation | 1.547037693 |
| Kurtosis | 14.69860143 |
| Mean | 13.21882952 |
| MAD | 13.28032554 |
| Skewness | 3.282660463 |
| Sum | 1731.666667 |
| Variance | 418.2036256 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 8 | 1.4% | |
| 0 | 5 | 0.9% | |
| 3.066666667 | 4 | 0.7% | |
| 1.033333333 | 3 | 0.5% | |
| 2.033333333 | 3 | 0.5% | |
| 6.966666667 | 2 | 0.4% | |
| 6.033333333 | 2 | 0.4% | |
| 0.7 | 2 | 0.4% | |
| 2.933333333 | 2 | 0.4% | |
| 8.133333333 | 2 | 0.4% | |
| Other values (90) | 98 | 17.7% | |
| (Missing) | 424 | 76.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -0.7 | 1 | 0.2% | |
| -0.4666666667 | 1 | 0.2% | |
| -0.2333333333 | 1 | 0.2% | |
| -0.06666666667 | 2 | 0.4% | |
| 0 | 5 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 142.9 | 1 | 0.2% | |
| 104.0333333 | 1 | 0.2% | |
| 69 | 1 | 0.2% | |
| 64.83333333 | 1 | 0.2% | |
| 54.73333333 | 1 | 0.2% |
min_vl_folha_coligados
Highly correlated
This variable is highly correlated with min_faturamento_est_coligados and should be ignored for analysis
| Correlation | 0.9115581462 |
|---|
min_vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with min_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9997894656 |
|---|
natureza_juridica_macro
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| OUTROS | |
|---|---|
| ENTIDADES EMPRESARIAIS |
| Value | Count | Frequency (%) | |
| OUTROS | 447 | 80.5% | |
| ENTIDADES EMPRESARIAIS | 108 | 19.5% |
| Max length | 22 |
|---|---|
| Mean length | 9.113513514 |
| Min length | 6 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_divisao
Categorical
| Distinct count | 8 |
|---|---|
| Unique (%) | 1.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO VAREJISTA | |
|---|---|
| FABRICACAO DE PRODUTOS ALIMENTICIOS | 16 |
| CONFECCAO DE ARTIGOS DO VESTUARIO E ACESSORIOS | 9 |
| Other values (5) | 34 |
| Value | Count | Frequency (%) | |
| COMERCIO VAREJISTA | 496 | 89.4% | |
| FABRICACAO DE PRODUTOS ALIMENTICIOS | 16 | 2.9% | |
| CONFECCAO DE ARTIGOS DO VESTUARIO E ACESSORIOS | 9 | 1.6% | |
| FABRICACAO DE MOVEIS | 9 | 1.6% | |
| FABRICACAO DE PRODUTOS DE MADEIRA | 9 | 1.6% | |
| FABRICACAO DE PRODUTOS DIVERSOS | 8 | 1.4% | |
| MANUTENCAO REPARACAO E INSTALACAO DE MAQUINAS E EQUIPAMENTOS | 7 | 1.3% | |
| FABRICACAO DE BEBIDAS | 1 | 0.2% |
| Max length | 60 |
|---|---|
| Mean length | 19.94234234 |
| Min length | 18 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_meso_regiao
Categorical
| Distinct count | 20 |
|---|---|
| Unique (%) | 3.6% |
| Missing (%) | 11.5% |
| Missing (n) | 64 |
| CENTRO AMAZONENSE | |
|---|---|
| LESTE POTIGUAR | |
| NORTE MARANHENSE | |
| Other values (16) | |
| (Missing) |
| Value | Count | Frequency (%) | |
| CENTRO AMAZONENSE | 78 | 14.1% | |
| LESTE POTIGUAR | 77 | 13.9% | |
| NORTE MARANHENSE | 65 | 11.7% | |
| CENTRO NORTE PIAUIENSE | 55 | 9.9% | |
| OESTE MARANHENSE | 36 | 6.5% | |
| LESTE MARANHENSE | 25 | 4.5% | |
| CENTRO MARANHENSE | 21 | 3.8% | |
| VALE DO ACRE | 20 | 3.6% | |
| OESTE POTIGUAR | 18 | 3.2% | |
| SUDESTE PIAUIENSE | 17 | 3.1% | |
| Other values (9) | 79 | 14.2% | |
| (Missing) | 64 | 11.5% |
| Max length | 22 |
|---|---|
| Mean length | 14.80540541 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_micro_regiao
Categorical
| Distinct count | 67 |
|---|---|
| Unique (%) | 12.1% |
| Missing (%) | 11.5% |
| Missing (n) | 64 |
| MANAUS | |
|---|---|
| NATAL | 58 |
| AGLOMERACAO URBANA DE SAO LUIS | 48 |
| Other values (63) | |
| (Missing) | 64 |
| Value | Count | Frequency (%) | |
| MANAUS | 65 | 11.7% | |
| NATAL | 58 | 10.5% | |
| AGLOMERACAO URBANA DE SAO LUIS | 48 | 8.6% | |
| TERESINA | 41 | 7.4% | |
| IMPERATRIZ | 20 | 3.6% | |
| RIO BRANCO | 15 | 2.7% | |
| PINDARE | 11 | 2.0% | |
| LITORAL SUL | 10 | 1.8% | |
| ALTO MEDIO CANINDE | 9 | 1.6% | |
| MEDIO MEARIM | 9 | 1.6% | |
| Other values (56) | 205 | 36.9% | |
| (Missing) | 64 | 11.5% |
| Max length | 33 |
|---|---|
| Mean length | 11.13153153 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nm_segmento
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | |
|---|---|
| INDUSTRIAS DE TRANSFORMACAO | 59 |
| Value | Count | Frequency (%) | |
| COMERCIO; REPARACAO DE VEICULOS AUTOMOTORES E MOTOCICLETAS | 496 | 89.4% | |
| INDUSTRIAS DE TRANSFORMACAO | 59 | 10.6% |
| Max length | 58 |
|---|---|
| Mean length | 54.7045045 |
| Min length | 27 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | True |
| Contains non-words | True |
nu_meses_rescencia
Numeric
| Distinct count | 24 |
|---|---|
| Unique (%) | 4.3% |
| Missing (%) | 8.5% |
| Missing (n) | 47 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 24.77165354 |
|---|---|
| Minimum | 7 |
| Maximum | 54 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 22 |
| Median | 23 |
| Q3 | 25 |
| 95-th percentile | 48 |
| Maximum | 54 |
| Range | 47 |
| Interquartile range | 3 |
Descriptive statistics
| Standard deviation | 9.969728522 |
|---|---|
| Coef of variation | 0.402465201 |
| Kurtosis | 1.800586709 |
| Mean | 24.77165354 |
| MAD | 6.057691115 |
| Skewness | 1.222048203 |
| Sum | 12584 |
| Variance | 99.39548681 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 23 | 124 | 22.3% | |
| 22 | 107 | 19.3% | |
| 24 | 51 | 9.2% | |
| 48 | 35 | 6.3% | |
| 25 | 35 | 6.3% | |
| 26 | 28 | 5.0% | |
| 21 | 22 | 4.0% | |
| 27 | 16 | 2.9% | |
| 9 | 13 | 2.3% | |
| 10 | 13 | 2.3% | |
| Other values (13) | 64 | 11.5% | |
| (Missing) | 47 | 8.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 7 | 10 | 1.8% | |
| 8 | 11 | 2.0% | |
| 9 | 13 | 2.3% | |
| 10 | 13 | 2.3% | |
| 11 | 5 | 0.9% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 54 | 3 | 0.5% | |
| 52 | 1 | 0.2% | |
| 50 | 11 | 2.0% | |
| 49 | 4 | 0.7% | |
| 48 | 35 | 6.3% |
percent_func_genero_fem
Numeric
| Distinct count | 31 |
|---|---|
| Unique (%) | 5.6% |
| Missing (%) | 81.8% |
| Missing (n) | 454 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 54.43534653 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 4.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 19.23 |
| Median | 53.85 |
| Q3 | 100 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 80.77 |
Descriptive statistics
| Standard deviation | 38.79162661 |
|---|---|
| Coef of variation | 0.712618346 |
| Kurtosis | -1.444618072 |
| Mean | 54.43534653 |
| MAD | 33.76817175 |
| Skewness | -0.1589297694 |
| Sum | 5497.97 |
| Variance | 1504.790295 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 100 | 32 | 5.8% | |
| 0 | 22 | 4.0% | |
| 50 | 11 | 2.0% | |
| 66.67 | 5 | 0.9% | |
| 33.33 | 4 | 0.7% | |
| 60 | 2 | 0.4% | |
| 75 | 2 | 0.4% | |
| 25 | 1 | 0.2% | |
| 62.5 | 1 | 0.2% | |
| 37.5 | 1 | 0.2% | |
| Other values (20) | 20 | 3.6% | |
| (Missing) | 454 | 81.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 22 | 4.0% | |
| 4.76 | 1 | 0.2% | |
| 14.29 | 1 | 0.2% | |
| 15.71 | 1 | 0.2% | |
| 19.23 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 32 | 5.8% | |
| 85.71 | 1 | 0.2% | |
| 80 | 1 | 0.2% | |
| 77.78 | 1 | 0.2% | |
| 75 | 2 | 0.4% |
percent_func_genero_masc
Numeric
| Distinct count | 31 |
|---|---|
| Unique (%) | 5.6% |
| Missing (%) | 81.8% |
| Missing (n) | 454 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 45.56465347 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros (%) | 5.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 46.15 |
| Q3 | 80.77 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range | 80.77 |
Descriptive statistics
| Standard deviation | 38.79162661 |
|---|---|
| Coef of variation | 0.851353487 |
| Kurtosis | -1.444618072 |
| Mean | 45.56465347 |
| MAD | 33.76817175 |
| Skewness | 0.1589297694 |
| Sum | 4602.03 |
| Variance | 1504.790295 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 0 | 32 | 5.8% | |
| 100 | 22 | 4.0% | |
| 50 | 11 | 2.0% | |
| 33.33 | 5 | 0.9% | |
| 66.67 | 4 | 0.7% | |
| 40 | 2 | 0.4% | |
| 25 | 2 | 0.4% | |
| 14.29 | 1 | 0.2% | |
| 37.5 | 1 | 0.2% | |
| 62.5 | 1 | 0.2% | |
| Other values (20) | 20 | 3.6% | |
| (Missing) | 454 | 81.8% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 32 | 5.8% | |
| 14.29 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| 22.22 | 1 | 0.2% | |
| 25 | 2 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 100 | 22 | 4.0% | |
| 95.24 | 1 | 0.2% | |
| 85.71 | 1 | 0.2% | |
| 84.29 | 1 | 0.2% | |
| 80.77 | 1 | 0.2% |
qt_admitidos
Numeric
| Distinct count | 41 |
|---|---|
| Unique (%) | 7.4% |
| Missing (%) | 76.4% |
| Missing (n) | 424 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 14.3740458 |
|---|---|
| Minimum | 1 |
| Maximum | 203 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.5 |
| Median | 5 |
| Q3 | 13.5 |
| 95-th percentile | 49.5 |
| Maximum | 203 |
| Range | 202 |
| Interquartile range | 12 |
Descriptive statistics
| Standard deviation | 26.95590013 |
|---|---|
| Coef of variation | 1.875317534 |
| Kurtosis | 25.18314462 |
| Mean | 14.3740458 |
| MAD | 15.13023717 |
| Skewness | 4.481608458 |
| Sum | 1883 |
| Variance | 726.620552 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 33 | 5.9% | |
| 2 | 14 | 2.5% | |
| 4 | 12 | 2.2% | |
| 6 | 7 | 1.3% | |
| 3 | 5 | 0.9% | |
| 8 | 5 | 0.9% | |
| 13 | 4 | 0.7% | |
| 9 | 4 | 0.7% | |
| 7 | 4 | 0.7% | |
| 10 | 4 | 0.7% | |
| Other values (30) | 39 | 7.0% | |
| (Missing) | 424 | 76.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 33 | 5.9% | |
| 2 | 14 | 2.5% | |
| 3 | 5 | 0.9% | |
| 4 | 12 | 2.2% | |
| 5 | 2 | 0.4% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 203 | 1 | 0.2% | |
| 164 | 1 | 0.2% | |
| 100 | 1 | 0.2% | |
| 69 | 1 | 0.2% | |
| 65 | 1 | 0.2% |
qt_admitidos_12meses
Numeric
| Distinct count | 12 |
|---|---|
| Unique (%) | 2.2% |
| Missing (%) | 76.4% |
| Missing (n) | 424 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.022900763 |
|---|---|
| Minimum | 0 |
| Maximum | 15 |
| Zeros (%) | 16.0% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 1 |
| 95-th percentile | 5.5 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range | 1 |
Descriptive statistics
| Standard deviation | 2.358179379 |
|---|---|
| Coef of variation | 2.305384318 |
| Kurtosis | 16.49629146 |
| Mean | 1.022900763 |
| MAD | 1.395839403 |
| Skewness | 3.707068821 |
| Sum | 134 |
| Variance | 5.561009982 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 0 | 89 | 16.0% | |
| 1 | 17 | 3.1% | |
| 2 | 7 | 1.3% | |
| 3 | 6 | 1.1% | |
| 4 | 4 | 0.7% | |
| 7 | 2 | 0.4% | |
| 6 | 2 | 0.4% | |
| 14 | 1 | 0.2% | |
| 9 | 1 | 0.2% | |
| 15 | 1 | 0.2% | |
| (Missing) | 424 | 76.4% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 89 | 16.0% | |
| 1 | 17 | 3.1% | |
| 2 | 7 | 1.3% | |
| 3 | 6 | 1.1% | |
| 4 | 4 | 0.7% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 15 | 1 | 0.2% | |
| 14 | 1 | 0.2% | |
| 9 | 1 | 0.2% | |
| 7 | 2 | 0.4% | |
| 6 | 2 | 0.4% |
qt_alteracao_socio_180d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_365d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_90d
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_alteracao_socio_total
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
qt_art
Highly correlated
This variable is highly correlated with qt_admitidos and should be ignored for analysis
| Correlation | 1 |
|---|
qt_coligadas
Highly correlated
This variable is highly correlated with min_filiais_coligados and should be ignored for analysis
| Correlation | 0.9682511623 |
|---|
qt_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.9974819746 |
|---|
qt_coligados_agropecuaria
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 62 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 62 | 11.2% | |
| 1 | 1 | 0.2% | |
| (Missing) | 492 | 88.6% |
qt_coligados_atividade_alto
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_atividade_baixo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_atividade_inativo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_atividade_medio
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_atividade_mt_baixo
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_ativo
Numeric
| Distinct count | 13 |
|---|---|
| Unique (%) | 2.3% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 3.793650794 |
|---|---|
| Minimum | 0 |
| Maximum | 42 |
| Zeros (%) | 0.2% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 2 |
| Q3 | 3 |
| 95-th percentile | 18.9 |
| Maximum | 42 |
| Range | 42 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 7.304790143 |
|---|---|
| Coef of variation | 1.925530456 |
| Kurtosis | 15.98976811 |
| Mean | 3.793650794 |
| MAD | 3.786344167 |
| Skewness | 3.911453634 |
| Sum | 239 |
| Variance | 53.35995904 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 30 | 5.4% | |
| 2 | 15 | 2.7% | |
| 3 | 6 | 1.1% | |
| 5 | 3 | 0.5% | |
| 9 | 2 | 0.4% | |
| 42 | 1 | 0.2% | |
| 0 | 1 | 0.2% | |
| 6 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| Other values (2) | 2 | 0.4% | |
| (Missing) | 492 | 88.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 1 | 0.2% | |
| 1 | 30 | 5.4% | |
| 2 | 15 | 2.7% | |
| 3 | 6 | 1.1% | |
| 4 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 42 | 1 | 0.2% | |
| 33 | 1 | 0.2% | |
| 23 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| 9 | 2 | 0.4% |
qt_coligados_baixada
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_ccivil
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
qt_coligados_centro
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.9142772641 |
|---|
qt_coligados_comercio
Numeric
| Distinct count | 11 |
|---|---|
| Unique (%) | 2.0% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 2.444444444 |
|---|---|
| Minimum | 0 |
| Maximum | 42 |
| Zeros (%) | 3.8% |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| Median | 1 |
| Q3 | 2 |
| 95-th percentile | 8.6 |
| Maximum | 42 |
| Range | 42 |
| Interquartile range | 2 |
Descriptive statistics
| Standard deviation | 6.341816616 |
|---|---|
| Coef of variation | 2.594379525 |
| Kurtosis | 26.42177439 |
| Mean | 2.444444444 |
| MAD | 2.888888889 |
| Skewness | 4.879665149 |
| Sum | 154 |
| Variance | 40.21863799 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 25 | 4.5% | |
| 0 | 21 | 3.8% | |
| 2 | 8 | 1.4% | |
| 5 | 2 | 0.4% | |
| 3 | 2 | 0.4% | |
| 42 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 9 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| 22 | 1 | 0.2% | |
| (Missing) | 492 | 88.6% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 0 | 21 | 3.8% | |
| 1 | 25 | 4.5% | |
| 2 | 8 | 1.4% | |
| 3 | 2 | 0.4% | |
| 4 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 42 | 1 | 0.2% | |
| 22 | 1 | 0.2% | |
| 20 | 1 | 0.2% | |
| 9 | 1 | 0.2% | |
| 5 | 2 | 0.4% |
qt_coligados_epp
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_exterior
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_inapta
Boolean
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 62 |
|---|---|
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 62 | 11.2% | |
| 1 | 1 | 0.2% | |
| (Missing) | 492 | 88.6% |
qt_coligados_industria
Highly correlated
This variable is highly correlated with qt_coligados_ccivil and should be ignored for analysis
| Correlation | 0.9011165757 |
|---|
qt_coligados_ltda
Highly correlated
This variable is highly correlated with qt_coligados_centro and should be ignored for analysis
| Correlation | 1 |
|---|
qt_coligados_matriz
Highly correlated
This variable is highly correlated with qt_coligados and should be ignored for analysis
| Correlation | 0.991544335 |
|---|
qt_coligados_me
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_mei
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_nordeste
Highly correlated
This variable is highly correlated with qt_coligados_ativo and should be ignored for analysis
| Correlation | 0.9006896596 |
|---|
qt_coligados_norte
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
qt_coligados_nula
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_sa
Highly correlated
This variable is highly correlated with qt_coligados_industria and should be ignored for analysis
| Correlation | 0.9802385571 |
|---|
qt_coligados_serviço
Highly correlated
This variable is highly correlated with max_vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9134551877 |
|---|
qt_coligados_sudeste
Highly correlated
This variable is highly correlated with qt_coligados_ltda and should be ignored for analysis
| Correlation | 0.9443641162 |
|---|
qt_coligados_sul
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_coligados_suspensa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 88.6% |
| Missing (n) | 492 |
| 0 | 63 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| (Missing) | 492 | 88.6% |
qt_desligados
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
qt_desligados_12meses
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
qt_ex_funcionarios
Highly correlated
This variable is highly correlated with qt_desligados and should be ignored for analysis
| Correlation | 0.9999417874 |
|---|
qt_filiais
Highly correlated
This variable is highly correlated with qt_coligados_sa and should be ignored for analysis
| Correlation | 0.9757969605 |
|---|
qt_funcionarios
Highly correlated
This variable is highly correlated with idade_de_49_a_53 and should be ignored for analysis
| Correlation | 0.9326400842 |
|---|
qt_funcionarios_12meses
Highly correlated
This variable is highly correlated with qt_funcionarios and should be ignored for analysis
| Correlation | 0.9757958364 |
|---|
qt_funcionarios_24meses
Highly correlated
This variable is highly correlated with qt_admitidos and should be ignored for analysis
| Correlation | 0.9220046051 |
|---|
qt_funcionarios_coligados
Highly correlated
This variable is highly correlated with qt_filiais and should be ignored for analysis
| Correlation | 0.9495362911 |
|---|
qt_funcionarios_coligados_gp
Highly correlated
This variable is highly correlated with qt_funcionarios_coligados and should be ignored for analysis
| Correlation | 0.971940039 |
|---|
qt_funcionarios_grupo
Highly correlated
This variable is highly correlated with qt_funcionarios_coligados_gp and should be ignored for analysis
| Correlation | 0.9157758137 |
|---|
qt_ramos_coligados
Highly correlated
This variable is highly correlated with qt_funcionarios_grupo and should be ignored for analysis
| Correlation | 0.966877865 |
|---|
qt_regioes_coligados
Highly correlated
This variable is highly correlated with qt_coligadas and should be ignored for analysis
| Correlation | 0.9142772641 |
|---|
qt_socios
Numeric
| Distinct count | 7 |
|---|---|
| Unique (%) | 1.3% |
| Missing (%) | 26.5% |
| Missing (n) | 147 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 1.279411765 |
|---|---|
| Minimum | 1 |
| Maximum | 9 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 0.6541251802 |
|---|---|
| Coef of variation | 0.5112702558 |
| Kurtosis | 50.12274312 |
| Mean | 1.279411765 |
| MAD | 0.4314446367 |
| Skewness | 5.302991985 |
| Sum | 522 |
| Variance | 0.4278797514 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 315 | 56.8% | |
| 2 | 81 | 14.6% | |
| 3 | 9 | 1.6% | |
| 5 | 1 | 0.2% | |
| 9 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| (Missing) | 147 | 26.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 315 | 56.8% | |
| 2 | 81 | 14.6% | |
| 3 | 9 | 1.6% | |
| 4 | 1 | 0.2% | |
| 5 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 9 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 4 | 1 | 0.2% | |
| 3 | 9 | 1.6% | |
| 2 | 81 | 14.6% |
qt_socios_coligados
Highly correlated
This variable is highly correlated with qt_coligados_ativo and should be ignored for analysis
| Correlation | 0.9824935146 |
|---|
qt_socios_feminino
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 65.8% |
| Missing (n) | 365 |
| 1 | |
|---|---|
| 2 | 11 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 179 | 32.3% | |
| 2 | 11 | 2.0% | |
| (Missing) | 365 | 65.8% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios_masculino
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pep
Highly correlated
This variable is highly correlated with qt_socios_masculino and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pf
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pj
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.7% |
| Missing (%) | 26.5% |
| Missing (n) | 147 |
| 0 | |
|---|---|
| 2 | 1 |
| 1 | 1 |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 406 | 73.2% | |
| 2 | 1 | 0.2% | |
| 1 | 1 | 0.2% | |
| (Missing) | 147 | 26.5% |
| Max length | 3 |
|---|---|
| Mean length | 3 |
| Min length | 3 |
| Contains chars | True |
| Contains digits | True |
| Contains spaces | False |
| Contains non-words | True |
qt_socios_pj_ativos
Highly correlated
This variable is highly correlated with qt_socios_pj and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_pj_baixados
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.6% |
| Missing (n) | 553 |
| 0 | 2 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.4% | |
| (Missing) | 553 | 99.6% |
qt_socios_pj_inaptos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.6% |
| Missing (n) | 553 |
| 0 | 2 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.4% | |
| (Missing) | 553 | 99.6% |
qt_socios_pj_nulos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.6% |
| Missing (n) | 553 |
| 0 | 2 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.4% | |
| (Missing) | 553 | 99.6% |
qt_socios_pj_suspensos
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 99.6% |
| Missing (n) | 553 |
| 0 | 2 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.4% | |
| (Missing) | 553 | 99.6% |
qt_socios_st_regular
Highly correlated
This variable is highly correlated with qt_socios_pep and should be ignored for analysis
| Correlation | 1 |
|---|
qt_socios_st_suspensa
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.4% |
| Missing (%) | 98.7% |
| Missing (n) | 548 |
| 1 | 7 |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 7 | 1.3% | |
| (Missing) | 548 | 98.7% |
qt_ufs_coligados
Highly correlated
This variable is highly correlated with qt_socios_pj_ativos and should be ignored for analysis
| Correlation | 1 |
|---|
setor
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.5% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| COMERCIO | |
|---|---|
| INDUSTRIA | 52 |
| SERVIÇO | 7 |
| Value | Count | Frequency (%) | |
| COMERCIO | 496 | 89.4% | |
| INDUSTRIA | 52 | 9.4% | |
| SERVIÇO | 7 | 1.3% |
| Max length | 9 |
|---|---|
| Mean length | 8.081081081 |
| Min length | 7 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sg_uf
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | 1.1% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| MA | |
|---|---|
| RN | |
| PI | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| MA | 157 | 28.3% | |
| RN | 116 | 20.9% | |
| PI | 103 | 18.6% | |
| AM | 94 | 16.9% | |
| RO | 64 | 11.5% | |
| AC | 21 | 3.8% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sg_uf_matriz
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 1.3% |
| Missing (%) | 0.0% |
| Missing (n) | 0 |
| MA | |
|---|---|
| RN | |
| PI | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| MA | 156 | 28.1% | |
| RN | 116 | 20.9% | |
| PI | 103 | 18.6% | |
| AM | 93 | 16.8% | |
| RO | 64 | 11.5% | |
| AC | 22 | 4.0% | |
| CE | 1 | 0.2% |
| Max length | 2 |
|---|---|
| Mean length | 2 |
| Min length | 2 |
| Contains chars | True |
| Contains digits | False |
| Contains spaces | False |
| Contains non-words | False |
sum_faturamento_estimado_coligadas
Highly correlated
This variable is highly correlated with qt_regioes_coligados and should be ignored for analysis
| Correlation | 0.9794843167 |
|---|
total
Highly correlated
This variable is highly correlated with qt_funcionarios_12meses and should be ignored for analysis
| Correlation | 0.9743791866 |
|---|
total_filiais_coligados
Numeric
| Distinct count | 12 |
|---|---|
| Unique (%) | 2.2% |
| Missing (%) | 97.3% |
| Missing (n) | 540 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | 14.53333333 |
|---|---|
| Minimum | 1 |
| Maximum | 46 |
| Zeros (%) | 0.0% |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| Median | 5 |
| Q3 | 25.5 |
| 95-th percentile | 41.8 |
| Maximum | 46 |
| Range | 45 |
| Interquartile range | 24.5 |
Descriptive statistics
| Standard deviation | 16.32205636 |
|---|---|
| Coef of variation | 1.123077272 |
| Kurtosis | -0.7883560931 |
| Mean | 14.53333333 |
| MAD | 13.70666667 |
| Skewness | 0.8762728433 |
| Sum | 218 |
| Variance | 266.4095238 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.9% | |
| 36 | 1 | 0.2% | |
| 46 | 1 | 0.2% | |
| 2 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 32 | 1 | 0.2% | |
| 19 | 1 | 0.2% | |
| 13 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| 40 | 1 | 0.2% | |
| (Missing) | 540 | 97.3% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| 1 | 5 | 0.9% | |
| 2 | 1 | 0.2% | |
| 3 | 1 | 0.2% | |
| 5 | 1 | 0.2% | |
| 13 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 46 | 1 | 0.2% | |
| 40 | 1 | 0.2% | |
| 36 | 1 | 0.2% | |
| 32 | 1 | 0.2% | |
| 19 | 1 | 0.2% |
tx_crescimento_12meses
Numeric
| Distinct count | 26 |
|---|---|
| Unique (%) | 4.7% |
| Missing (%) | 82.5% |
| Missing (n) | 458 |
| Infinite (%) | 0.0% |
| Infinite (n) | 0 |
| Mean | -0.13232472 |
|---|---|
| Minimum | -100 |
| Maximum | 200 |
| Zeros (%) | 11.4% |
Quantile statistics
| Minimum | -100 |
|---|---|
| 5-th percentile | -66.9047619 |
| Q1 | 0 |
| Median | 0 |
| Q3 | 0 |
| 95-th percentile | 34 |
| Maximum | 200 |
| Range | 300 |
| Interquartile range | 0 |
Descriptive statistics
| Standard deviation | 40.32431378 |
|---|---|
| Coef of variation | -304.7375713 |
| Kurtosis | 12.55770928 |
| Mean | -0.13232472 |
| MAD | 16.53372542 |
| Skewness | 2.337460584 |
| Sum | -12.83549784 |
| Variance | 1626.050282 |
| Memory size | 4.4 KiB |
| Value | Count | Frequency (%) | |
| 0 | 63 | 11.4% | |
| -50 | 3 | 0.5% | |
| -16.66666667 | 3 | 0.5% | |
| -6.666666667 | 3 | 0.5% | |
| -100 | 2 | 0.4% | |
| 14.28571429 | 2 | 0.4% | |
| 100 | 2 | 0.4% | |
| 200 | 2 | 0.4% | |
| -33.33333333 | 1 | 0.2% | |
| -71.42857143 | 1 | 0.2% | |
| Other values (15) | 15 | 2.7% | |
| (Missing) | 458 | 82.5% |
Minimum 5 values
| Value | Count | Frequency (%) | |
| -100 | 2 | 0.4% | |
| -72.72727273 | 1 | 0.2% | |
| -71.42857143 | 1 | 0.2% | |
| -67.85714286 | 1 | 0.2% | |
| -66.66666667 | 1 | 0.2% |
Maximum 5 values
| Value | Count | Frequency (%) | |
| 200 | 2 | 0.4% | |
| 100 | 2 | 0.4% | |
| 50 | 1 | 0.2% | |
| 30 | 1 | 0.2% | |
| 25 | 1 | 0.2% |
tx_crescimento_24meses
Highly correlated
This variable is highly correlated with min_vl_folha_coligados_gp and should be ignored for analysis
| Correlation | 0.9675760839 |
|---|
tx_rotatividade
Highly correlated
This variable is highly correlated with qt_art and should be ignored for analysis
| Correlation | 1 |
|---|
Unnamed_0
Highly correlated
This variable is highly correlated with qt_socios_pj_ativos and should be ignored for analysis
| Correlation | 1 |
|---|
vl_faturamento_estimado_aux
Highly correlated
This variable is highly correlated with qt_socios_pj_ativos and should be ignored for analysis
| Correlation | 1 |
|---|
vl_faturamento_estimado_grupo_aux
Highly correlated
This variable is highly correlated with qt_socios_pj_ativos and should be ignored for analysis
| Correlation | 1 |
|---|
vl_folha_coligados
Highly correlated
This variable is highly correlated with vl_faturamento_estimado_grupo_aux and should be ignored for analysis
| Correlation | 0.9801776077 |
|---|
vl_folha_coligados_gp
Highly correlated
This variable is highly correlated with vl_folha_coligados and should be ignored for analysis
| Correlation | 0.9769040844 |
|---|
vl_frota
Highly correlated
This variable is highly correlated with total_filiais_coligados and should be ignored for analysis
| Correlation | 1 |
|---|
vl_idade_maxima_socios_pj
Highly correlated
This variable is highly correlated with vl_faturamento_estimado_grupo_aux and should be ignored for analysis
| Correlation | 1 |
|---|
vl_idade_media_socios_pj
Highly correlated
This variable is highly correlated with nu_meses_rescencia and should be ignored for analysis
| Correlation | 1 |
|---|
vl_idade_minima_socios_pj
Highly correlated
This variable is highly correlated with vl_idade_media_socios_pj and should be ignored for analysis
| Correlation | 1 |
|---|
vl_potenc_cons_oleo_gas
Highly correlated
This variable is highly correlated with tx_crescimento_24meses and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_tancagem
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_tancagem_grupo
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_antt
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_antt_grupo
Constant
This variable is constant and should be ignored for analysis
| Constant value | nan |
|---|
vl_total_veiculos_leves
Highly correlated
This variable is highly correlated with total_filiais_coligados and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_leves_grupo
Highly correlated
This variable is highly correlated with vl_idade_maxima_socios_pj and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_pesados
Highly correlated
This variable is highly correlated with vl_potenc_cons_oleo_gas and should be ignored for analysis
| Correlation | 1 |
|---|
vl_total_veiculos_pesados_grupo
Highly correlated
This variable is highly correlated with vl_total_veiculos_leves_grupo and should be ignored for analysis
| Correlation | 0.9945105446 |
|---|